Multi-resolution auditory cepstral coefficient and adaptive mask for speech enhancement with deep neural network
نویسندگان
چکیده
منابع مشابه
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement
We propose a multi-objective framework to learn both secondary targets not directly related to the intended task of speech enhancement (SE) and the primary target of the clean log-power spectra (LPS) features to be used directly for constructing the enhanced speech signals. In deep neural network (DNN) based SE we introduce an auxiliary structure to learn secondary continuous features, such as ...
متن کاملMulti-Modal Hybrid Deep Neural Network for Speech Enhancement
Deep Neural Networks (DNN) have been successful in enhancing noisy speech signals. Enhancement is achieved by learning a nonlinear mapping function from the features of the corrupted speech signal to that of the reference clean speech signal. The quality of predicted features can be improved by providing additional side channel information that is robust to noise, such as visual cues. In this p...
متن کاملDeep Neural Network Approach for Single Channel Speech Enhancement Processing
..................................................................................................................................... ii Acknowledgements .................................................................................................................. iii Table of contents .............................................................................................................
متن کاملNMF-based speech enhancement incorporating deep neural network
Recently, lots of algorithms using machine learning approaches have been proposed in the speech enhancement area. One of the most well-known approaches is the non-negative matrix factorization (NMF) -based one which analyzes noisy speech with speech and noise bases. However, NMF-based algorithms have difficulties in estimating speech and noise encoding vectors when their subspaces overlap. In t...
متن کاملSubjective Intelligibility of Deep Neural Network-Based Speech Enhancement
Recent literature indicates increasing interest in deep neural networks for use in speech enhancement systems. Currently, these systems are mostly evaluated through objective measures of speech quality and/or intelligibility. Subjective intelligibility evaluations of these systems have so far not been reported. In this paper we report the results of a speech recognition test with 15 participant...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Advances in Signal Processing
سال: 2019
ISSN: 1687-6180
DOI: 10.1186/s13634-019-0618-4